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OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



June 15, 2005, 10:19:12 ; Search time 178.268 Seconds 

(without alignments) 
846.121 Million cell updates/sec 

US-10-053-975A-1 
2047 

1 MAVSESQLKKMVSKYKYRDL FQLRALMQKARKTAGLSDLY 390 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 2105692 seqs, 386760381 residues 

Total number of hits satisfying chosen parameters: 2105692 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 



Database 



A_Geneseq_16Dec04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6: geneseqp2003as : * 

7: geneseqp2003bs : * 

8: geneseqp2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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ALIGNMENTS 



RESULT 1 
AAW93424 

ID AAW93424 standard; protein; 390 AA. 
XX 

AC AAW93424; 
XX 

DT ll-JUN-1999 (first entry) 
XX 

DE Human NHTS protein. 
XX 

KW Human tumour suppressor; NHTS; treatment; diagnosis; lymphoma; cancer; 

KW brain; breast; colon; heart; kidney; ovary; paraganglia; pancreas? 

KW prostate; skin; stomach; thyroid; autoimmune disease; asthma; diabetes; 

KW biliary cirrhosis; Crohn f s disease; rheumatoid arthritis. 

XX 

OS Homo sapiens. 
XX 

PN US5892016-A. 



PD 06-APR-1999. 
XX 

PF 23-JAN-1997; 97US-00786999 . 
XX 

PR 23-JAN-1997; 97US-00786999 . 
XX 

PA (INCY-) INCYTE PHARM. 
XX 

PI Brie SL, Goli SK; 
XX 

DR WPI; 1999-253932/21. 

DR N-PSDB; AAX23168. 
XX 

PT Novel human tumor suppressor - useful for the diagnosis or treatment of 



PT lymphoma, cancer, and autoimmune disease. 
XX 

PS Claim 1; Fig 1A-B; 25pp; English. 
XX 

CC This invention describes the isolation of a novel human tumour suppressor 

CC (NHTS) . The products of the invention may be used for the diagnosis or 

CC treatment of conditions and diseases which are associated with expression 

CC of NHTS e.g. lymphoma and cancers of the brain, breast, colon, heart, 

CC kidney, , ovary, paraganglia, pancreas, prostate, skin, stomach and thyroid 

CC and autoimmune disease e.g. asthma, biliary cirrhosis, Crohn's disease, 

CC diabetes, and rheumatoid arthritis 
XX 

SQ Sequence 390 AA; 



Query Match 100.0%; Score 2047; DB 2; Length 390; 

Best Local Similarity 100.0%; Pred. No. 7e-149; 

Matches 390; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MAVSESQLKKMVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELMNLTGTIP 60 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MAVSESQLKKMVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELMNLTGTIP 60 

Qy 61 VPYRGNTYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHEWKHP 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 VPYRGNTYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHEWKHP 120 

Qy 121 QSDLLGLIQVMIWFGDEPPVFSRPISASYPPYQATGPPNTSYMPGMPGGISPYPSGYPP 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 QSDLLGLIQVMIWFGDEPPVFSRPISASYPPYQATGPPNTSYMPGMPGGISPYPSGYPP 180 

Qy 181 NPSGYPGCPYPPGGPYPATTSSQYPSQPPVTTVGPSRDGTISEDTIRASLISAVSDKLRW .240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 NPSGYPGCPYPPGGPYPATTSSQYPSQPPVTTVGPSRDGTISEDTIRASLISAVSDKLRW 240 

Qy 241 RMKEEMDRAQAELNALKRTEEDLKKGHQKLEEMVTRLDQEVAEVDKNIELLKKKDEELSS 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 RMKEEMDRAQAELNALKRTEEDLKKGHQKLEEMVTRLDQEVAEVDKNIELLKKKDEELSS 300 

Qy 301 ALEKMENQSENNDI DEVI I PTAPLYKQI LNLYAEENAI EDTI FYLGEALRRGVI DLDVFL 360 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 ALEKMENQSENNDI DEVI I PTAPLYKQI LNLYAEENAI EDTI FYLGEALRRGVI DLDVFL 360 



361 KHVRLLSRKQFQLRALMQKARKTAGLSDLY 390 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
361 KHVRLLSRKQFQLRALMQKARKTAGLSDLY 390 



us-10-053-975a-l.rai 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



June 15, 2005, 10:33:17 



search time 46.8487 Seconds 
(without alignments) 
621.429 Million cell updates/sec 



US-10-053-975A-1 
2047 

1 MAVSESQLKKMVSKYKYRDL FQLRALMQKARKTAGLSDLY 390 



BLOSUM62 
Gapop 10.0 



Gapext 0.5 



Searched: 513545 seqs, 74649064 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



513545 



Post-processing: 



Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 



Database : lssued_Patents_AA:* 

1 : /cgn2_6/ptodata/l/i aa/5/^COMB . pep : * 

2 : /cgn2_6/ptodata/l/i aa/5B_COMB . pep : * 

3 : /cgn2_6/ptodata/l/i aa/6A_COMB . pep : * 

4 : /cgn2_6/ptodata/l/i aa/6B_COMB . pep : * 

5 : /cgn2_6/ptodata/l/i aa/PCTUS_COMB . pep : * 

6 : /cgn2_6/ptodata/l/i aa/backf i 1 esl . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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US-09-886-319A-82 Sequence 82, Appl 

US-08-999-774A-8 Sequence 8, Appli 

US-09-270-767-43042 Sequence 43042, A 

US-09-513-999C-5770 Sequence 5770, Ap 

US-09-270-767-58371 Sequence 58371, A 

US-09-248-796A- 16547 Sequence 16547, A 

US-08-526-136-13 Sequence 13, Appl 

US-09-949-016-7070 Sequence 7070, Ap 

US-09-949-016-7071 Sequence 7071, Ap 

US-09-461-325-453 Sequence 453, App 

US-10-012-542-453 Sequence 453, App 

US-10-115-123-453 Sequence 453, App 

US-09-949-016-7068 Sequence 7068, Ap 

US-09-949-016-7069 Sequence 7069, Ap 

US-09-949-016-7621 Sequence 7621, Ap 

US-08-574-959A-9 Sequence 9, Appli 

US-09-357-014-9 Sequence 9, Appli 

US-08-574-959A-7 Sequence 7, Appli 

US-09-357-014-7 Sequence 7, Appli 

US-09-949-016-7289 Sequence 7289, Ap 

US-09-248-796A- 14568 Sequence 14568, A 

US-09-535-008-63 Sequence 63, Appl 

US-09-535-008-61 Sequence 61, Appl 

US-09-535-008-67 Sequence 67, Appl 

US-09-535-008-2 Sequence 2, Appli 

US-09-824-574-4 Sequence 4, Appli 

US-09-538-092-1172 Sequence 1172, Ap 

US-09-535-008-75 Sequence 75, Appl 

US-09-535-008-71 Sequence 71, Appl 

US-09-949-016-9752 Sequence 9752, Ap 

US-09-535-008-69 Sequence 69, Appl 

US-09-535-008-65 Sequence 65, Appl 

US-09-535-008-77 Sequence 77, Appl 

us-09-535-008-73 Sequence 73, Appl 

US-09-266-225D-18 Sequence 18, Appl 

US-09-041-886-23 Sequence 23, Appl 

US-09-538-092-1209 Sequence 1209, Ap 

US-09-800-729-138 Sequence 138, App 

US-09-800-729-106 Sequence 106, App 

US-09-800-729-199 Sequence 199, App 

US-08-714-741-32 Sequence 32, Appl 

US-08-642-255-32 Sequence 32, Appl 

US-07-609-716-65 Sequence 65, Appl 

US-08-475-411A-65 Sequence 65, Appl 

US-08-478-029A-65 Sequence 65, Appl 

US-08-728-323A-2 Sequence 2, Appli 

US-09-298-568-2 Sequence 2, Appli 

US-09-410-399-2 Sequence 2, Appli 

US-09-894-273-2 Sequence 2, Appli 

us-09-095-443-2 Sequence 2, Appli 

US-09-270-767-46446 Sequence 46446, A 

US-09-026-587-4 Sequence 4, Appli 

US-09-227-420-4 Sequence 4, Appli 

US-09-387-811-4 Sequence 4, Appli 

US-09-823-240A-10 Sequence 10, Appl 

US-09-949-016-7291 Sequence 7291, Ap 

US-09-417-197-125 Sequence 125, App 

US-09-080-897-2 Sequence 2, Appli 

US-09-323-735-2 Sequence 2, Appli 

US-08-899-595-3 Sequence 3, Appli 

US-09-902-540-13993 Sequence 13993, A 

us-09-949-016-7961 Sequence 7961, Ap 

US-09-949-016-8706 Sequence 8706, Ap 
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81 120 5.9 777 4 US-09-949-016-8707 Sequence 8707, Ap 

82 119.5 5.8 503 3 US-08-526-136-2 Sequence 2, Appli 

83 119.5 5.8 505 3 US-08-526-136-4 Sequence 4, Appli 

84 118.5 5.8 397 4 US-09-248-796A- 18882 Sequence 18882, A 

85 118.5 5.8 1255 2 US-09-080-897-4 Sequence 4, Appli 

86 118.5 5.8 1255 3 US-08-899-595-1 Sequence 1, Appli 

87 118.5 5.8 1255.3 US-09-323-735-4 Sequence 4, Appli 

88 118 5.8 214 1 US-08-217-327-4 Sequence 4, Appli 

89 118 5.8 581 4 US-09-949-016-9978 Sequence 9978, Ap 

90 117 5.7 416 4 US-09-690-454-136 Sequence 136, App 

91 117 5.7 779 4 US-10-164-595-56 Sequence 56, Appl 

92 117 5.7 843 4 US-10-164-595-54 Sequence 54, Appl 

93 116.5 5.7 338 1 US-08-218-686-2 Sequence 2, Appli 

94 116.5 5.7 338 3 US-08-460-242-2 Sequence 2, Appli 

95 116 5.7 1461 4 US-09-585-887-9 Sequence 9, Appli 

96 116 5.7 1461 4 us-09-289-578-9 Sequence 9, Appli 

97 116 5.7 1464 4 US-09-331-347C-21 Sequence 21, Appl 

98 115.5 5.6 357 1 US-07-609-716-66 Sequence 66, Appl 

99 115.5 5.6 357 1 US-08-642-255-33 Sequence 33, Appl 
100 115.5 5.6 357 3 US-08-475-411A-66 Sequence 66, Appl 



ALIGNMENTS 



RESULT 1 
US-08-786-999-1 

; sequence 1, Application US/08786999 

; Patent No. 5892016 

; GENERAL INFORMATION: 

; APPLICANT: La Brie, Sam 

; APPLICANT: Goli, Surya K. 

; TITLE OF INVENTION: NOVEL HUMAN TUMOR 

; TITLE OF INVENTION: SUPPRESSOR 

; NUMBER OF SEQUENCES: 3 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

; STREET: 3174 Porter Drive 

; CITY: Palo Alto 

; STATE : CA 

; COUNTRY: USA 

; ZIP: 94304 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Diskette 

; COMPUTER: IBM Compatible 

; OPERATING SYSTEM: DOS 

; SOFTWARE: FastSEQ for Windows Version 2.0 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/786 , 999 

; FILING DATE: Filed Herewith 

; CLASSIFICATION: 530 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 

; FILING DATE: 

; ATTORNEY/AGENT INFORMATION: 

; NAME: Billings, Lucy 3 . 

; REGISTRATION NUMBER: 36,749 

; REFERENCE/DOCKET NUMBER: PF-0199 US 

; TELECOMMUNICATION INFORMATION: 

; TELEPHONE: 415-855-0555 

; TELEFAX: 415-845-4166 

; INFORMATION FOR SEQ ID NO: 1: 

; SEQUENCE CHARACTERISTICS: 
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LENGTH: 390 amino acids 
TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

IMMEDIATE SOURCE: 
LIBRARY: ColnnotOl 
CLONE: 609476 
US-08-786-999-1 

Query Match 100.0%; Score 2047; DB 2; Length 390; 

Best Local Similarity 100.0%; Pred. No. 7.3e-159; 

Matches 390; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MAVSESQLKKMVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELMNLTGTIP 60 

II 1 1 III 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 II 1 1 1 1 II I Ml M 1 1 II 1 1 1 1 II 1 1 1 INI I M 

Db 1 MAVSESQLKKMVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELMNLTGTIP 60 

Qy 61 VPYRGNTYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHEWKHP 120 

Mill II II MM II II III II INI III Mi MM II II II Mil 1 1 II III II II III 

Db 61 VPYRGNTYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHEWKHP 120 

Qy 121 QSDLLGLIQVMIWFGDEPPVFSRPISASYPPYQATGPPNTSYMPGMPGGISPYPSGYPP 180 

II 1 1 1 MM II MM 1 1 1 1 II I II 1 1 II 1 1 II II 1 1 II 1 1 II 1 1 MM 1 1 II I II 1 1 1 1 1 

Db 121 QSDLLGLIQVMIWFGDEPPVFSRPISASYPPYQATGPPNTSYMPGMPGGISPYPSGYPP 180 

Qy 181 NPSGYPGCPYPPGGPYPATTSSQYPSQPPVTTVGPSRDGTISEDTIRASLISAVSDKLRW 240 

II III II II I M II II MM II II MM MM Mill Ml II I MM Ml M II I Mill 

Db 181 NPSGYPGCPYPPGGPYPATTSSQYPSQPPVTTVGPSRDGTISEDTIRASLISAVSDKLRW 240 

Qy 241 RMKEEMDRAQAELNALKRTEEDLKKGHQKLEEMVTRLDQEVAEVDKNIELLKKKDEELSS 300 

MIMMMMMMMMMMMMMMMMMIMMMMMMMIMIMM 

Db 241 RMKEEMDRAQAELNALKRTEEDLKKGHQKLEEMVTRLDQEVAEVDKNIELLKKKDEELSS 300 

Qy 301 ALEKMENQSENNDIDEVIIPTAPLYKQILNLYAEENAIEDTIFYLGEALRRGVIDLDVFL 360 

1 1 1 1 1 II II II 1 1 1 1 1 1 1 II II II 1 1 II II 1 1 II II II II 1 1 II 1 1 M II II I II I II 1 1 

Db 301 ALEKMENQSENNDIDEVIIPTAPLYKQILNLYAEENAIEDTIFYLGEALRRGVIDLDVFL 360 

Qy 361 KHVRLLSRKQFQLRALMQKARKTAGLSDLY 390 

I II I M III I I I M M M I I II I I II II I I 
Db 361 KHVRLLSRKQFQLRALMQKARKTAGLSDLY 390 



o 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



ON! protein - protein search, using sw model 



Run on: 
Title: 

Perfect score: 
Sequence: 

Scoring table: 
Searched: 



June 15, 2005, 10:46:54 



US-10-053-975A-1 
2047 

1 MAVSESQLKKMVSKYKYRDL . 



Search time 160.016 Seconds 
(without alignments) 
934.287 Million cell updates/sec 



FQLRALMQKARKTAGLSDLY 390 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1710399 seqs, 383334425 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1710399 



Post-processing: 



Database : 



Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 

Publ i shed^Appl i cati ons_AA : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 



/cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB .pep: * 
/cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/USlOA_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/USl0B_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/USlOC_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/USl0D_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/USl0E_PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/USl0_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/USllA^PUBCOMB . pep : 
/cgn2_6/ptodata/l/pubpaa/USll_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB . pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 

Result Query 

No. Score Match Length DB ID 



SUMMARIES 



Description 
Sequence 2, Appli 



2047 100.0 



390 9 US-09-886-319A-2 
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10.6 
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466 
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466 
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148.5 


7.3 
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148 
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176 
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483 
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142 


6.9 


505 
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142 


6.9 


505 


40 


142 


6.9 


505 


41 


142 


6.9 


505 


42 


142 


6.9 


983 


43 


142 


6.9 


1021 


44 


142 


6.9 


1251 


45 


139.5 


6.8 


258 


46 


139. 5 


6.8 


278 


47 


138. 5 


6.8 


827 


48 


136 


6.6 


1066 


49 


135.5 


6.6 


180 


50 


134 


6.5 


1613 


51 
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6.5 


336 


52 


132 


6.4 


371 


53 


132 
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371 


54 


132 


6.4 


371 


55 


132 


6.4 


371 


56 


132 


6.4 


371 


57 


132 


6.4 


371 


58 


131.5 


6.4 


605 


59 


131.5 


6.4 


889 


60 


131 


6.4 


101 
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101 


62 


131 
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518 


63 


131 
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588 


64 
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6.4 


873 
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14 US-10-243-815A-1 

14 US-10-053-975A-1 

14 US-10-376-564-2 

16 US-10-67S-979-1 

9 US-09-804-690-4 

9 US-09-886-319A-1 

14 US-10-205-194-109 

14 US-10-376-564-1 

9 US-09-804-690-2 

14 US-10-243-815A-3 

9 US-09-886-319A-82 

14 US-10-376-564-82 

15 US-10-264-049-2861 

16 US-10-437-963-164064 
14 US-10-029-386-28907 

9 US-09-925-299-1257 

10 US-09-925-299-1257 

14 US-10-029-386-29129 
16 US-10-425-115-299224 
16 US-10-425-115-295478 

15 US-10-424-599-239306 

16 US-10-425-115-298252 
9 US-09-801-368-386 

15 US-10-424-599-283387 

17 US-10-734-049A-223 

9 US-09-925-300-1664 

16 US-10-408-765A-377 
14 US-10-012-542-453 
14 US-10-115-123-453 

14 US-10-106-698-5906 

15 US-10-094-749-2983 

17 US-10-927-644-1 

16 US-10-437-963-191498 

15 US-10-264-049-2971 

16 US-10-437-963-106668 
16 US-10-739-930-7139 
16 US-10-408-765A-539 
16 US-10-719-993-624 

16 US-10-719-993-626 

17 US-10-952-459-10 

15 US-10-112-944-435 

16 US-10-408-765A-313 
15 US-10-112-944-881 

15 US-10-424-599-176931 

15 US-10-425-114-68284 

15 US-10-149-310-68 

16 US-10-437-963-157156 

16 US-10-437-963-191496 

17 US-10-732-923-8825 
16 US-10-437-963-201912 

10 US-09-284-320-2 
14 US-10-197-666A-4 
14 US-10-024-298A-41 

14 US-10-042-211A-41 

15 US-10-617-217A-41 

16 US-10-024-298A-41 

16 US-10-437-963-102875 

16 US-10-437-963-197045 

16 US-10-739-930-6173 

17 US-10-732-923-2393 

15 US-10-369-493-3240 

16 US-10-437-963-150465 
16 US-10-437-963-142576 
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Sequence 1, Appli 
Sequence 1, Appli 
Sequence 2, Appli 
Sequence 1, Appli 

Sequence 4, Appli 

Sequence 1, Appli 
Sequence 109, App 
Sequence 1, Appli 

Sequence 2, Appli 
Sequence 3, Appli 

Sequence 82, Appl 
Sequence 82, Appl 
Sequence 2861, Ap 
Sequence 164064, 
Sequence 28907, A 

Sequence 1257, Ap 
Sequence 1257, Ap 
Sequence 29129, A 
Sequence 299224, 
Sequence 295478, 
Sequence 239306, 
Sequence 298252, 

Sequence 386, App 
Sequence 283387, 
Sequence 223, App 

Sequence 1664, Ap 
Sequence 377, App 
Sequence 453, App 
Sequence 453, App 
Sequence 5906, Ap 
Sequence 2983, Ap 
Sequence 1, Appli 
Sequence 191498, 
Sequence 2971, Ap 
Sequence 106668, 
Sequence 7139, Ap 
Sequence 539, App 
Sequence 624, App 
Sequence 626, App 
Sequence 10, Appl 
Sequence 435, App 
Sequence 313, App 
Sequence 881, App 
Sequence 176931, 
Sequence 68284, A 
Sequence 68, Appl 
Sequence 157156, 
Sequence 191496, 
Sequence 8825, Ap 
Sequence 201912, 
Sequence 2, Appli 
Sequence 4, Appli 
Sequence 41, Appl 
Sequence 41, Appl 
Sequence 41, Appl 
Sequence 41, Appl 
Sequence 102875, 
Sequence 197045, 
Sequence 6173, Ap 
Sequence 2393, Ap 
Sequence 3240, Ap 
Sequence 150465, 
Sequence 142576, 
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RESULT 1 
US-09-886-319A-2 

; Sequence 2, Application US/09886319A 

; Publication No. US20020086019A1 

; GENERAL INFORMATION: 

; APPLICANT: Wolf, Eckard 
; APPLICANT: Werner, Sabine 
; APPLICANT: Halle, Dorn-Peter 
; APPLICANT: Regenbogen, Johannes 
; APPLICANT: Goppelt, Andreas 

; TITLE OF INVENTION: Use of Polypeptides or Nucleic Acids for 

; TITLE OF INVENTION: the Diagnosis or Treatment of skin Disorders and Wound 

; TITLE OF INVENTION: Healing and for the Identification of Pharmacologically 

; TITLE OF INVENTION: Active Substances 

; FILE REFERENCE: 50125/014002 

; CURRENT APPLICATION NUMBER: US/09/886 , 319A 

; CURRENT FILING DATE: 2001-06-20 

; PRIOR APPLICATION NUMBER: US 60/222,081 

; PRIOR FILING DATE: 2000-08-01 

; PRIOR APPLICATION NUMBER: DE 10030149.5 

; PRIOR FILING DATE: 2000-06-20 

; NUMBER OF SEQ ID NOS: 84 
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us-10-053-975a-l.rapb 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 2 
LENGTH: 390 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-886-319A-2 

Query Match 100.0%; Score 2047; DB 9; Length 390; 

Best Local Similarity 100.0%; Pred. No. 3.2e-133; 

Matches 390; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MAVSESQLKKMVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELMNLTGTIP 60 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I II I I I I II I I I I 
Db 1 MAVSESQLKKMVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELMNLTGTIP 60 

Qy 61 VPYRGNTYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHEWKHP 120 

I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 VPYRGNTYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHEWKHP 120 

Qy 121 QSDLLGLIQVMIWFGDEPPVFSRPISASYPPYQATGPPNTSYMPGMPGGISPYPSGYPP 180 

I I I I I I II I I I I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 QSDLLGLIQVMIWFGDEPPVFSRPISASYPPYQATGPPNTSYMPGMPGGISPYPSGYPP 180 

Qy 181 NPSGYPGCPYPPGGPYPATTSSQYPSQPPVTTVGPSRDGTISEDTIRASLISAVSDKLRW 240 

I I I I I I I I I I I I I I I I II I I I I I I I III I I I I MM I I II I I I I II I I I II I III I II II 

Db 181 NPSGYPGCPYPPGGPYPATTSSQYPSQPPVTTVGPSRDGTISEDTIRASLISAVSDKLRW 240 

Qy 241 RMKEEMDRAQAELNALKRTEEDLKKGHQKLEEMVTRLDQEVAEVDKNIELLKKKDEELSS 300 

I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M M 1 1 1 1 M M 1 1 M I M 1 1 

Db 241 RMKEEMDRAQAELNALKRTEEDLKKGHQKLEEMVTRLDQEVAEVDKNIELLKKKDEELSS 300 

Qy 301 ALEKMENQSENNDIDEVIIPTAPLYKQILNLYAEENAIEDTIFYLGEALRRGVIDLDVFL 360 

I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 ALEKMENQSENNDIDEVIIPTAPLYKQILNLYAEENAIEDTIFYLGEALRRGVIDLDVFL 360 

Qy 361 KHVRLLSRKQFQLRALMQKARKTAGLSDLY 390 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 KHVRLLSRKQFQLRALMQKARKTAGLSDLY 390 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



June 15, 2005, 10:32:03 ; Search time 36.5055 Seconds 

(without alignments) 
1027.916 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-053-975A-1 
2047 

1 MAVSESQLKKMVS KYKYRDL FQLRALMQKARKTAGLSDLY 390 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 



283416 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: 



Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 



Database 



PIRJ79:* 
pirl : * 
pir2:* 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
H88650 

protein C09G12.9 [imported] - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 10-May-2001 #sequence_revision 10-May-2001 #text_change 09-Jul-2004 
C;Accession:. H88650 

R; anonymous, The C. elegans Sequencing Consortium. 
Science 282, 2012-2018, 1998 

A; Title: Genome sequence of the nematode C. elegans: a platform for 
investigating biology. 

A; Reference number: A75000; MUID: 99069613; PMID: 9851916 
A;Note: see websites genome.wustl.edu/gsc/C_elegans/ and 
www_sanger.ac.uk/Projects/C_elegans/ for a list of authors 

A;Note: published errata appeared in Science 283, 35, 1999; Science 283, 2103, 

1999; and Science 285, 1493, 1999 

A;Accession: H88650 

A; Status : preliminary 

A; Molecule type: DNA 

A; Residues: 1-404 <ST0> 

A;Cross-references : UNIPROT : 076258 ; GB:chr_IV; PIDN: AAC25822 . 1; PID : g32944 95 ; 
GSPDB:GN00022; CESP : C09G12 . 9 
C; Genetics: 



A; Gene: C09G12.9 
A; Map position: 4 



Query Match 32.1%; Score 658; DB 2; Length 404; 

Best Local Similarity 36.7%; Pred. No. l.le-30; 

Matches 151; Conservative 74; Mismatches 121; Indels 66; Gaps 11; 

Qy 16 KYRDLTVRETVWITLYKDLKPVIjDSWFNDGSSRELMNLTGTIPVPYRGNTYNIPICLW 75 

III : : : : : : I I I I I : : : I I I I I I I I I I I : I MM::: 
Db 16 KYADSAKKDIIGALSQFKDLSPGTDTmFPDGKRRTAFRLKGTIPVTYKGACYNIPVTVY 75 

Qy 76 LLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHEWKHPQSDLLGLIQVMIWF 135 

I I I : I I I I I : I I I I : I : I I : I I : : I I I I : I I : I I I I I : I : 
Db 76 LWDTH P YYAP I CYVNPTSTM ESEHVNKEGKVFLPYLNEWRFPGYDLSGLLQM 127 

Qy 136 GDEP P VFS RP I SAS YP P YQATGP PNTSYMPGMPGGISPYPSG YP 179 

: I : I : I III I I I I I : I I I : I II 
Db 128 IFARSAANSATNASATNPSAGSSASSTPTPYPSSQPTMP TPYPTGSGAAPYP 179 

Qy 180 PNPSGYPGC PYPPG GPYPATTSSQY PSQPPV TTVG 214 

I : : I I llll I I I : : I : llll hi 

Db 180 PSSTPYPSAGAMGYNPYMNVPQSTPYPMGASGASPYPSASSNPAPPPPRPPPVTAQTSVS 239 

Qy 215 P S RDGT I S EDT I RAS LI SAVS DKLRWRMKEEMDRAQAELNALKRTEEDLKKGHQKLEEMV 274 

I III I I I I I I : : I I I : I : I : : : I I I I : : : : I : : I : : I I 1 I : I : 
Db 240 SSSGGTIQADTIRASVMSAVEEKIRAKLRERMGTNSAEMASIRTTSDELREGQQKLKRML 299 

Qy 275 TRLDQEVAEVDKNI ELLKKKDEELS SALEKMENQSENNDI DEVI I PTAPLYKQI LNLYAE 334 

I : : : : I : I I I : I I : : llll I I : : I I : I I : 

Db 300 EELETQRS S LQTACE I YTAKKAELAKALS D-AGGTDAP P I DEAI DAAFPLHRQI VLN YAK 358 

Qy 335 ENAI EDT I F YLGEALRRGVI DLDVFLKHVRLLS RKQFQLRALMQKARKTAGL 38 6 

: : : I : : II : I : I I I : I I : I I I I I I I I : I I I I 

Db 359 DLTC Q S LKKRQITLAE YLRHVRDVS REQ FI YRATMQKCRRTAGL 402 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

perfect score: 
Sequence: 

Scoring table: 



June 15, 2005, 10:20:07 ; Search time 168.534 Seconds 

(without alignments) 
1184.992 Million cell updates/sec 

US-10-053-975A-1 
2047 

1 MAVSESQLKKMVSKYKYRDL FQLRALMQKARKTAGLSDLY 390 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1612378 seqs, 512079187 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 



1612378 



Database 



Listing first 100 summaries 

UniProt_03:* 
1: uniprot_sprot:* 
2: uniprot_trembl :* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 

NO. 


Score 


Query 
Match 


Length DB 


ID 


1 


2047 


100.0 


390 


1 


T101_HUMAN 


2 


2041 


99.7 


390 


2 


Q9BUM5 


3 


1945.5 


95.0 


391 


1 


T101_MOUSE 


4 


1927.5 


94.2 


391 


2 


Q6IRE4 


5 


1908.5 


93.2 


391 


2 


Q7TSE5 


6 


1899 


92.8 


392 


2 


Q9I8G8 


7 


1705.5 


83.3 


390 


2 


Q6IQ70 


8 


1685 


82.3 


394 


2 


Q6NUD5 


9 


1682.5 


82.2 


395 


2 


Q6P2Z8 


10 


1024.5 


50.0 


408 


2 


Q9WA7 


11 


976 


47.7 


402 


2 


Q7Q6B6 


12 


754.5 


36.9 


425 


2, 


076258 


13 


749.5 


36.6 


326 


2 


Q6DDX9 


14 


599 


29.-3 


249 


2 


Q8MQZ0 


15 


522 


25.5 


114 


2 


Q7T2M2 


16 


455 


22.2 


398 


2 


Q9LHG8 


17 


435 


21.3 


379 


2 


Q8IX04 


18 


432 


21.1 


580 


2 


Q7S4R9 


19 


429.5 


21.0 


249 


2 


Q7TQD3 


20 


427 


20.9 


174 


2 


008761 


21 


415.5 


20.3 


476 


2 


Q66KB7 



Description 
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Q9lhg8 
Q8ix04 
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Q66kb7 
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drosophila 
anopheles g 
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xenopus lae 
drosophila 
ameiurus ne 
arabidopsis 
homo sapien 
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mus musculu 
mus musculu 
xenopus tro 
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23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 
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53 

54 

55 

56 

57 

58 

59 
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63 

64 
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67 

68 

69 
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74 

75 

76 
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398 
396.5 
391 
388 
371 
371 
341 
303 
301.5 
301.5 
294 
294 
239 
215 
169 
155.5 
154.5 
153.5 
153 
152 
151.5 
148.5 
148.5 
148.5 
148.5 
147.5 
147 
145.5 
144.5 
144.5 
144.5 
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144 
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142 
141.5 
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141.5 
141 
141 
141 
140.5 
139.5 
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139 
138.5 
138.5 
138.5 
138 
137 
136.5 
136 
136 
135.5 
135.5 
135.5 
135.5 
135 
135 
135 
135 
134.5 



19.4 
19.4 
19.1 
19.0 
18.1 
18.1 
16.7 
14.8 
14.7 



14 
14 
14 
11 
10 
8 



7 
4 
4 

7 
5 
3 

7.6 
7.5 
7.5 



7. 
7. 
7. 
7. 
7. 
7. 
7. 
7. 
7. 
7. 
7. 
7. 
7, 
7, 
7. 
7. 
6. 
6. 
6. 
6. 
6, 
6, 
6, 
6, 
6, 
6, 



6.8 
6.8 
6.8 
6.8 



6. 
6. 
6. 
6. 
6. 
6. 
6. 
6. 
6. 
6. 
6. 



6.6 
6.6 
6.6 
6.6 
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471 2 Q6DBY5 Q6dby5 brachydanio 

402 2 Q6ESB7 Q6esb7 oryza sativ 

368 2 Q9FFY6 Q9ffy6 arabidopsis 

83 2 Q6LBE4 Q6lbe4 mus musculu 

177 2 Q96FF5 Q96ff5 homo sapien 

341 2 Q9NUX7 Q9nux7 homo sapien 

357 2 Q6P2F0 Q6p2f0 homo sapien 

385 1 ST22_YEAST P25604 saccharomyc 

378 2 Q6C148 Q6cl48 yarrowia li 

378 2 Q873M7 Q873m7 yarrowia li 

111 2 Q8BU96 Q8bu96 mus musculu 

468 2 Q6FS29 Q6fs29 Candida gla 

496 2 Q6BID5 Q6bid5 debaryomyce 

445 2 Q75EU1 Q75eul ashbya goss 

376 2 P78998 P78998 saccharomyc 

299 2 Q6CS18 Q6csl8 kluyveromyc 

397 2 Q6NW27 Q6nw27 homo sapien 

148 2 Q6FIG7 Q6fig7 homo sapien 

283 2 Q641Z8 Q641z8 rattus norv 
1158 2 Q6GM32 Q6gm32 xenopus lae 

485 2 Q804G3 Q804g3 brachydanio 

148 2 Q6NXQ6 Q6nxq6 homo sapien 

148 2 Q9NZ81 Q9nz81 homo sapien 

397 2 Q8NEZ2 Q8nez2 homo sapien 

397 2 Q96DL9 Q96dl9 homo sapien 

263 1 Y737_DROME P83475 drosophila 

397 2 Q8CHS8 Q8chs8 mus musculu 

1130 2 Q8IZL8 Q8izl8 homo sapien 

743 2 Q9BU60 Q9bu60 homo sapien 

1048 2 Q96FT1 Q96ftl homo sapien 

1130 2 Q6NTE6 Q6nte6 homo sapien 

483 2 Q7T391 Q7t391 brachydanio 

488 1 ANX7_HUMAN P20073 homo sapien 

664 2 Q6CDQ5 Q6cdq5 yarrowia li 

505 1 ANXB_HUMAN P50995 homo sapien 

1021 2 015451 015451 homo sapien 

284 2 Q6BGN1 Q6bgnl debaryomyce 
503 1 ANXB_MOUSE P97384 mus musculu 
503 2 Q921F1 Q921fl mus musculu 
247 2 Q945K9 Q945k9 arabidopsis 
463 1 ANX7_MOUSE Q07076 mus musculu 
463 2 Q922A2 Q922a2 mus musculu 

1167 2 Q640S7 Q640s7 xenopus tro 

900 2 Q6PSU8 Q6psu8 arabidopsis 

415 2 Q8CCV9 Q8ccv9 mus musculu 

1284 2 015450 015450 homo sapien 

827 2 O60130 060130 schizosacch 

838 2 Q871B7 Q871b7 neurospora 

4903 1 MLL3_MOUSE Q8brh4 mus musculu 

526 2 Q804G4 Q804g4 brachydanio 

463 2 Q8BP75 Q8bp75 mus musculu 

503 1 ANXB_RABIT P33477 oryctolagus 

137 2 Q9CQD5 Q9cqj5 m mus muscu 

1066 2 Q7XE90 Q7xe90 oryza sativ 

437 2 Q9STN0 Q9stn0 arabidopsis 

895 2 Q9LIW7 Q91iw7 oryza sativ 

998 2 Q6GNM2 Q6qnm2 xenopus lae 

1105 2 Q8BM34 Q8bmj4 mus musculu 

463 2 Q8VIN2 Q8vin2 rattus norv 

463 2 Q6IRJ7 Q6irj7 rattus norv 

674 2 Q18106 Q18106 caenorhabdi 

703 2 Q7SCK8 Q7sck8 neurospora 

809 2 Q7ZVN7 Q7zvn7 brachydanio 
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6.5 
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086316 
Q7d8kl 
Q9utp6 
Q64lh0 
Q7u0b9 
Q6ddw4 
Q91d31 
Q6nre3 
Q9z502 
Q7z429 
Q94b77 
Q9f i uO 
Q81cl8 
059907 
Q6m9d4 
Q7rz53 



mycobacteri 

mycobacteri 

schizosacch 

xenopus lae 

mycobacteri 

xenopus lae 

crypthecodi 

xenopus lae 

streptomyce 

homo sapien 

arabidopsis 

arabidopsis 

arabidopsis 

neurospora 

neurospora 

neurospora 



ALIGNMENTS 



PRT; 390 AA. 



RESULT 1 
T101_HUMAN 

ID T101_HUMAN STANDARD; 

AC Q99816; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Tumor susceptibility gene 101 protein. 

GN Name=TSGl01; 

os Homo sapiens (Human). 

oc Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TlssUE=Placenta; 

RX MEDLINE=97148696; PubMed=9019400 ; DOI=10. 1016/S0092-8674(00)81866-8; 

RA Li L., Li x. , Francke u., Cohen S.N.; 

RT "The TSG101 tumor susceptibility gene is located in chromosome 11 band 

RT pl5 and is mutated in human breast cancer."; 

RL Cell 88:143-154(1997). 

RN [2] 

RP ERRATUM. 

RX MEDLINE=99080983; PubMed=9867424; DOI=10. 1016/50092-8674(00)89342-3; 

RA Li L., Francke U., Cohen S.N.; 

RL Cell 93:660-660(1998). 

RN [3] 

RP ALTERNATIVE SPLICING (ISOFORM 2) . 

RX MEDLINE=98031856; PubMed=9366528 ; DOI=10. 1038/sj .one. 1201591; 

RA Gayther S.A., Barski P., Batley S.3., Li L., de Foy K.A., Cohen S.N., 

RA Ponder B.A., Caldas C; 

RT "Aberrant splicing of the TSG101 and FHIT genes occurs frequently in 

RT multiple malignancies and in normal tissues and mimics alterations 

RT previously described in tumours."; 

RL Oncogene 15:2119-2126(1997). 

RN [4] 

RP ALTERNATIVE SPLICING. 

RX MEDLINE=97384943; PubMed=9242438 ; 

RA Lee M.P., Feinberg A. P.; 

RT "Aberrant splicing but not mutations of TSG101 in human breast 

RT cancer."; 

RL Cancer Res. 57:3131-3134(1997). 
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RN [5] 

RP ALTERNATIVE SPLICING. 

RX MEDLINE=99054675; PubMed=9840940; DOI=10. 1038/sj .one. 1202529; 

RA Wagner K.-U., Dierisseau P., Rucker E.B. Ill, Robinson G.W., 

RA Hennighausen L.; 

RT "Genomic architecture and transcriptional activation of the mouse and 

RT human tumor susceptibility gene TSG101: common types of shorter 

RT transcripts are true alternative splice variants. ' ; 

RL Oncogene 17:2761-2770(1998). 

RN [6] 

RP INTERACTION WITH DMAPl. 

RX MEDLINE=20347709; PubMed=10888872 ; DOI=10. 1038/77023; 

RA Rountree M.R., Bachman K.E., Baylin S.B.; 

RT "DNMTl binds HDAC2 and a new co-repressor , DMAPl, to form a complex at 

RT replication foci."; 

RL Nat. Genet. 25:269-277(2000). 

RN [7] 

RP STRUCTURE BY NMR OF 1-145. 

RX MEDLINE=22001147; PubMed=12006492 ; DOI=10.1093/emboj/21.10.2397; 

RA Pornillos O., Alam S.L., Rich R.L., Myszka D.G., Davis D.R., 

RA sundquist W.I. ; 

RT "structure and functional interactions of the TsglOl UEV domain."; 

RL EMBO J. 21:2397-2406(2002). 

RN [8] 

RP STRUCTURE BY NMR OF 1-145. 

RX MEDLINE=22289677; PubMed=12379843 ; DOI=10. 1038/nsb856; 

RA Pornillos 0., Alam S.L., Davis D.R., Sundquist W.I.; 

RT "structure of the TsglOl UEV domain in complex with the PTAP motif of 

RT the HIV-1 p6 protein."; 

RL Nat. Struct. Biol. 9:812-817(2002). 

cc -!- FUNCTION: May be involved in cell growth and differentiation. Acts 
cc as a negative growth regulator. 

CC -!- SUBUNIT: interacts with ubiquitin, stathmin, GMCL and DMAPl (By 
cc similarity). 

cc -!- SUBCELLULAR LOCATION: Mainly cytoplasmic. Depending on the stage 
CC of the cell cycle, detected in the nucleus. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Al ternative splicing; Named isoforms=2; 

CC comment=Additional isoforms seem to exist. Several shorter 

CC isoforms are detected in primary breast cancers and other 

CC tumors; 

CC Name=l; 

CC lsoId=Q99816-l; Sequence=Displayed; 

CC Name=2; 

CC lsoId=Q99816-2; Sequence=VSP_004440 ; 

CC Note=Detected in normal as well as cancer tissues; 

CC -!- TISSUE SPECIFICITY: Heart, brain, placenta, lung, liver, skeletal, 

cc kidney and pancreas. 

CC -!- DOMAIN: The UEV domain is required for the interaction of the 
CC complex with ubiquitin. 

CC -!- DOMAIN: The coiled coil domain may interact with stathmin. 

CC -!- SIMILARITY: Contains 1 UEV (ubiquitin E2 variant) domain. 

CC This swiss-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

cc the European Bioinformatics Institute. There are no restrictions on its 

cc use by non-profit institutions as long as its content is in no way 

cc modified and this statement is not removed. Usage by and for commercial 

cc entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; U82130; AAC52083.1; -. 

DR PDB ; 1KPP; NMR; A=l-145. 
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DR PDB ; lKPQ; NMR; A=l-145. 
DR PDB ; 1M4P; NMR; A=l-145. 
DR PDB; 1M4Q; NMR; A=l-145 . 
DR intAct; Q99816; 
DR Genew; HGNC: 15971; TSG101. 
DR H-InvDB ; HIX0009490; 
DR MIM; 601387; 

DR GO; GO: 0003677; F: DNA binding; TAS. 

DR GO; GO:0003714; F: transcription corepressor activity; TAS. 
DR GO; GO: 00065 12; P:ubiquitin cycle; TAS. 
DR interPro; IPR010983; EF_Hand_like. 
DR InterPro; IPR008883; TsglOl. 
DR InterPro; IPR000608; UBQ^conjugat . 
DR Pfam; PF05743; TsglOl; 1. 

DR PROSITE; PS00183; UBIQUITIN_CONJUGAT_l; FALSE_NEG. 
DR PROSITE; PS50127; UBIQUITIN_CONJUGAT_2 ; FALSE_NEG. 
KW 3D-structure; Alternative splicing; Coiled coil; Growth regulation; 
KW Protein transport; Transport; Ubl conjugation pathway. 
FT DOMAIN 1 133 UEV. 

FT DOMAIN 235 316 Coiled coil (Potential) 

FT VARSPLIC 15 119 Missing (in isoform 2), 

FT /FTld=VSP_004440. 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 

SQ SEQUENCE 390 AA; 43944 MW; ADD6912FC22DF162 CRC64; 

Query Match 100.0%; Score 2047; DB 1; Length 390; 

Best Local similarity 100.0%; Pred. No. 2.1e-99; 

Matches 390; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MAVSESQLKKMVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELMNLTGTIP 60 
I I I I I I I II I II I II II I I II Ml II I I I I I I I I I I I I I I I I I I I I I II I I llll I I M I 
MAVSESQLKKMVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELMNLTGTIP 60 

VPYRGNTYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHEWKHP 120 
I I I I I I I I I I I I I I I I I I I I I III I I I II I llll II II I I I I I I I I I I I I I I I I I II II I 
VPYRGNTYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHEWKHP 120 

QSDLLGLIQVMIWFGDEPPVFSRPISASYPPYQATGPPNTSYMPGMPGGISPYPSGYPP 180 
I M I I I I II I II I I I I I I I II I I I II I II I I I I I I I I I I I I II I II I II Ml II I I I I I I 
QSDLLGLIQVMIWFGDEPPVFSRPISASYPPYQATGPPNTSYMPGMPGGISPYPSGYPP 180 

NPSGYPGCPYPPGGPYPATTSSQYPSQPPVTTVGPSRDGTISEDTIRASLISAVSDKLRW 240 

I II 1 1 Ml 1 1 1 Mill 1 1 1 1 1 III II Ml MM Ml III MM II I II III 1 1 II II II I 

NPSGYPGCPYPPGGPYPATTSSQYPSQPPVTTVGPSRDGTISEDTIRASLISAVSDKLRW 240 

RMKEEMDRAQAELNALKRTEEDLKKGHQKLEEMVTRLDQEVAEVDKNIELLKKKDEELSS 300 
I II I I M II III II II II I II III Mill I I I I II I II I II I I I I I Mill MM I III I 
RMKEEMDRAQAELNALKRTEEDLKKGHQKLEEMVTRLDQEVAEVDKNIELLKKKDEELSS 300 
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1 


133 


DOMAIN 


235 


316 


VARSPLIC 


15 


119 


HELIX 


4 


11 


HELIX 


18 


31 


TURN 


33 


34 


STRAND 


36 


44 


TURN 


45 


47 


STRAND 


48 


63 


TURN 


64 


65 


STRAND 


66 


75 


STRAND 


86 


88 


STRAND 


103 


103 


TURN 


105 


106 


STRAND 


108 


109 


HELIX 


112 


115 


TURN 


116 


116 


TURN 


119 


120 


HELIX 


124 


137 


SEQUENCE 


390 AA; 


43' 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

301 ALEKMENQSENNDIDEVIIPTAPLYKQILNLYAEENAIEDTIFYLGEALRRGVIDLDVFL 360 

361 KHVRLLSRKQFQLRALMQKARKTAGLSDLY 390 

I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I 
361 KHVRLLSRKQFQLRALMQKARKTAGLSDLY 390 
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